Amendments to the Claims 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1 . (currently amended) An apparatus within a pipelined microprocessor for 

forwarding store instruction results to a pipeline stage for execution of a load 
instruction, the apparatus comprising: 

a result forwarding cache (RFC), for storing at least one non-store instruction 
result destined for a user-visible register of the microprocessor, and for 
storing a plurality of store instruction results destined for a data cache of 
the microprocessor : 

comparison logic, for comparing a load address of the load instruction with a 
plurality of store addresses associated with said plurality of store 
instruction results to generate an address match signal; and 

control logic, configured to receive said match signal and selectively forward one 
of said plurality of store instruction results fi*om said RFC to the pipeline 
stage in response to said match signal , thereby avoiding stalling the load 
instruction until said one of said plurality of store instruction results is 
updated into said data cache . 

2. (original) The apparatus of claim 1, wherein said plurality of store instruction results 

comprise data to be stored from the microprocessor into a memory attached 
thereto. 

3. (original) The apparatus of claim 1, wherein said load address specifies a location of 

data to be loaded into the microprocessor fi-om a memory attached thereto. 

4. (original) The apparatus of claim 1, wherein said RFC comprises a plurality of 

storage elements for storing a predetermined number of instruction results. 
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5. (original) The apparatus of claim 4, wherein said instruction results are received by 
said RFC from an execution unit of the microprocessor. 



6. (original) The apparatus of claim 4, wherein said plurality of storage elements store 

said predetermined number of instruction results in a first-in-first-out manner. 

7. (original) The apparatus of claim 4, wherein said predetermined number of 

instruction results is five. 

8. (original) The apparatus of claim 1, wherein said load address and said plurality of 

store addresses comprise virtual addresses. 

9. (original) The apparatus of claim 8, wherein said virtual addresses comprise x86 

linear addresses. 

10. (currently amended) An apparatus for forwarding storehit data within stages of a 

pipelined microprocessor, the apparatus comprising: 

a result forwarding cache (RFC), configured to forward at least one non-store 
instruction result, and to store and forward a first plurality of store 
instruction results destined for a data cache of the microprocessor : 

a data unit, configured to forward a second plurality of store instruction results; 
and 

selection logic, coupled to said RFC and said data unit, for selectively providing 
one of said first and second plurality of store instruction results to a stage 
of the microprocessor pipeline executing a load instruction. 

11. (original) The apparatus of claim 10, wherein said load instruction comprises a load 

address for specifying an address of data to be loaded into the microprocessor, 
wherein said selection logic is configured to forward one of said first and second 
plurality of store instruction results only if said load address matches one or more 
of a first and second plurality of store addresses corresponding to said first and 
second plurality of store instruction results. 
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12. (original) The apparatus of claim 11, wherein selection logic forwards said first 

plurality of store instruction results forwarded by said RFC at a higher priority 
than said second plurality of store instructions results forwarded by said data unit 
if said load address matches both one or more of said first plurality of store 
addresses and one or more of said second plurality of store addresses. 

13. (original) The apparatus of claim 1 1, further comprising: 

comparison logic, coupled to said selection logic, for comparing said load address 
with said first and second plurality of store addresses to determine whether 
said load address matches one or more of said first and second plurality of 
store addresses. 

14. (original) The apparatus of claim 11, wherein said data unit is configured to forward 

said second plurality of store instruction results fi"om a plurality of store buffers of 
the microprocessor. 

15. (original) The apparatus of claim 14, wherein said plurality of store buffers is 

configured to store said second plurality of store instruction results while said 
second plurality of store instruction results are written to a memory coupled to the 
microprocessor. 

16. (original) The apparatus of claim 14, wherein said data unit is configured to forward 

a newest one of said second plurality of store instruction results if said load 
address matches more than one of said second plurality of store addresses. 

17. (original) The apparatus of claim 11, wherein said RFC is configured to forward a 

newest one of said first plurality of store instruction results if said load address 
matches more than one of said first plurality of store addresses. 
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18. (previously presented) An apparatus for detecting storehit conditions in a pipelined 

microprocessor in a hierarchical manner, the apparatus comprising: 

first comparison logic, for comparing a load instruction load address in a first 
stage of the pipeline with a first plurality of store addresses of first store 
instruction data in a plurality of stages of the pipeline subsequent to said 
first pipeline stage, wherein said plurality of stages of the pipeline 
subsequent to said first pipeline stage comprise non-store buffers; 

second comparison logic, for comparing said load address with a second plurality 
of store addresses of second store instruction data in a plurality of store 
buffers of the microprocessor; and 

control logic, coupled to said first and second comparison logic, configured to 
determine which of said first and second store instruction data is newest 
based on said first and second comparison logic comparing, and to 
forward said newest store instruction data to said first pipeline stage in 
response thereto. 

19. (original) The apparatus of claim 18, wherein said first comparison logic is 

configured to compare virtual addresses. 

20. (original) The apparatus of claim 1 8, wherein said second comparison logic is 

configured to compare physical addresses. 
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21. (currently amended) An apparatus for speculatively forwarding storehit data in a 
microprocessor pipeline, the apparatus comprising: 

a plurality of first and second virtual address comparators, for comparing a virtual 
load address with a plurality of first and second virtual store addresses to 
generate a virtual match signal for indicating whether first and second 
storehit data is likely present in a store buffer and a result forwarding 
cache, respectively, of the microprocessor, wherein if said first and second 
storehit data are both present said second storehit data is newer than said 
first storehit data : 

a plurality of first and second p hysical address comparators, for comparing a 
physical load address translated from said virtual load address with a 
pluralit>^ of first and second p hysical store addresses translated from said 
plurality of virtual store addresses to generate a physical match signal for 
indicating whether said first and second storehit data is certainly present in 
said store buffer and said result forwarding cache, respectively ; 

forwarding logic, coupled to receive said virtual match signal, for forwarding the 
said second storehit data present in said store buffer in response to said 
virtual match signal indicating no match between said virtual load address 
and said plurality of second virtual store address[[es] ] but a match 
between said virtual load address and said first virtual store address, prior 
to generation of said physical match signal; and 

control logic, for receiving said virtual and physical match signals and generating 
a stall signal for stalling the pipeline subsequent to said forwarding logic 
forwarding said storehit data from said store buffer if said physical match 
signal indicates a match between said physical load address and one of 
said plurality of second p hysical store address[[es]] bu ^although said 
virtual match signal previously indicated indicates n o match between said 
virtual load address and one of said plurality of second virtual store 
address|"[esn , until correct data specified by said physical load address is 
provided to replace said previously forwarded second storehit data . 



22. (currently amended) The apparatus of claim 21, further comprising: 

a data unit, configured to forward said correct data specified by the -said physical 
load address to replace said p reviously forwarded second storehit data; 

wherein said control logic is configured to deassert said stall signal after said data 
unit forwards said correct data. 

23. (previously presented) A pipelined microprocessor for speculatively forwarding 

storehit data from a first pipeline stage to a second pipeline stage, wherein the 
storehit data is specified by a load address in the second stage, comprising: 
address region logic, configured to receive the load address and generate a match 
signal to indicate whether the load address is within one of a plurality of 
non-cacheable address regions of the microprocessor address space stored 
therein; 

forwarding logic, for forwarding the storehit data from the first stage to the 
second stage prior to said address region logic generating said match 
signal; 

control logic, configured to receive said match signal and to assert a stall signal, 
subsequent to said forwarding logic forwarding the storehit data, to stall 
the pipeline if said address region logic indicates the load address is within 
one of said plurality of non-cacheable address regions, and to 
subsequently obtain using the load address non-cached correct data fi*om a 
device external to the microprocessor, for provision to the second stage. 

24. (original) The microprocessor of claim 23, fijrther comprising: 

a bus interface unit, for receiving data from a bus coupled to the microprocessor, 
said bus further coupled to a system memory and a plurality of peripheral 
devices; and 

at least one response buffer, operatively coupled to the second stage, for receiving 
load data specified by the load address from said bus interface unit, and 
for providing said load data to the second stage to replace the storehit data 
if the load address is within one of said plurality of non-cacheable address 
regions. 
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25. (original) The microprocessor of claim 23, wherein said plurality of non-cacheable 

regions stored in said address region logic are software-programmable. 

26. (previously presented) A method for forwarding storehit data in a microprocessor 

pipeline, the method comprising: 

storing at least one store instruction result and at least one non-store instruction 

result into a result forwarding cache of the microprocessor; 
detecting a storehit condition, wherein a load instruction in a stage of the pipeline 

specifies data generated by a previous store instruction, wherein said data 

is still present in the pipeline; 
determining whether said data is present in said result forwarding cache; 
selectively forwarding said data from said result forwarding cache to said stage if 

said data is in said result forwarding cache; and 
selectively forwarding said data from a data unit of the microprocessor to said 

stage if said data is not in said result forwarding cache. 

27. (original) The method of claim 26, further comprising: 

storing results data of each store instruction executed by an execution unit of the 
microprocessor in said result forwarding cache. 

28. (original) The method of claim 26, wherein said detecting said storehit condition 

comprises: 

comparing an address of said data specified by said load instruction with a 
plurality of store instruction result data addresses stored in the pipeline 
below said stage; and 

determining said address matches one or more of said plurality of data addresses. 

29. (original) The method of claim 26, wherein said determining whether said data is 

present in said result forwarding cache comprises: 

comparing an address of said data specified by said load instruction with a 
plurality of store instruction result data addresses stored in a 
predetermined number of stages of the pipeline below said stage; 
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wherein said predetermined number equals a number of result entries in said 
result forwarding cache. 

30. (currently amended) A method for speculatively forwarding storehit data in a 

microprocessor pipeline, the method comprising: 

determining that a virtual load address matches ne — a first virtual store 
address[[es]] present in the pipeline to indicate first storehit data is likely 
present in a store buffer of the microprocessor, but does not match a 
second virtual store address present in the pipeline to indicate second 
newer storehit data is likely absent in a result forwarding cache of the 
microprocessor ; 

forwarding the first storehit data fi-om a first stage comprising the store buffer t o a 
second stage of the pipeline having a load instruction specifying the load 
address based on said determining that said virtual load address matches 
no virtual store addresses present in the pipeline the first storehit data is 
likely present in the store buffer and the second newer storehit data is 
likely absent in the result forwarding cache : 

detecting that a physical load address translated from said virtual load address 
matches a physical store address translated from the second virtual store 
address to indicate the second newer storehit data is certainly present in 
the result forwarding cache p resent in the pipeline , subsequent to said 
forwarding said -the first storehit data; and 

stalling the pipeline in response to said detecting that said physical load address 
translated from said virtual load address matches said physical store 
address present in the pipeline , until correct data specified by said physical 
load address is provided to replace the previously forwarded second newer 
storehit data . 

3 1 . (original) The method of claim 30, further comprising: 

forwarding correction data from a third stage of the pipeline to said second stage 

after said stalling the pipeline; and 
unstalling the pipeline after said forwarding said correction data. 
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32. (canceled) 



33. (original) The method of claim 30, wherein said storehit data comprises a store 

instruction result within the pipeline having an identical physical store address as 
said physical load address. 

34. (previously presented) A method for speculatively forwarding storehit data in a 

microprocessor pipeline, the method comprising: 

detecting a storehit condition by comparing a load address with a plurality of store 
addresses; 

forwarding storehit data in response to said detecting said storehit condition; 
determining said load address is within a non-cacheable address region 

subsequent to said speculatively forwarding; and 
stalling the pipeline in response to said determining said load address is within a 

non-cacheable address region. 
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35. (new) A method for speculatively forwarding storehit data in a microprocessor 

pipeline, the method comprising: 

comparing a virtual load address with first and second virtual store addresses, 
wherein a load instruction specifying the virtual load address is newer than 
a first store instruction specifying the first virtual store address, which is 
newer than a second store instruction specifying the second virtual store 
address; 

speculatively forwarding a result of the first store instruction to the load 
instruction, in response to said comparing indicating the virtual load 
address matches the first virtual store address and mismatches the second 
virtual store address; 

comparing a physical load address with a physical store address, wherein the 
physical load address is a translation of the virtual load address, wherein 
the physical store address is a translation of the second virtual store 
address; 

determining said forwarding the result of the first store instruction to the load 
instruction was incorrect, after said speculatively forwarding the result of 
the first store instruction, in response to said comparing indicating the 
physical load address matches the physical store address; and 

forwarding a resuh of the second store instruction to the load instruction, in 
response to said determining. 

36. (new) The method of claim 35, wherein said comparing the virtual load address 

with the first virtual store address comprises determining whether the virtual load 
address hits in a data cache of the microprocessor, wherein said speculatively 
forwarding the result of the first store instruction to the load instruction comprises 
the data cache providing the result of the first store instruction to the load 
instruction. 
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37. (new) The method of claim 35, further comprising: 

stalling the pipeline, in response to said determining, until said forwarding the 
result of the second store instruction to the load instruction. 

38. (new) The method of claim 35, further comprising: 

writing the result of the second store instruction to a data cache of the 
microprocessor, in response to said determining; 

wherein said forwarding the result of the second store instruction to the load 
instruction comprises the data cache providing the result of the second 
store instruction to the load instruction. 

39. (new) The method of claim 35, further comprising: 

reissuing the load instruction, in response to said determining. 

40. (new) The method of claim 39, wherein said reissuing the load instruction 

comprises providing the virtual load address to a data cache of the microprocessor 
from a replay buffer of the microprocessor. 

41. (new) The method of claim 35, wherein said comparing the virtual load address 

with the second virtual store address is performed prior to the result of the second 
store instruction being stored a store buffer of the microprocessor. 

42. (new) The method of claim 35, wherein said speculatively forwarding the result 

of the second store instruction to the load instruction comprises forwarding the 
result of the second store instruction from a result forwarding cache (RFC) of the 
microprocessor. 

43. (new) The method of claim 42, further comprising: 

caching the result of the second store instruction in the RFC, prior to said 
comparing the virtual load address with the second virtual store address. 
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(new) The method of claim 43, further comprising: 

caching a result of a non-store instruction in the RFC, prior to said comparing the 
virtual load address with the second virtual store address. 
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